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Abstract 

Radio-frequency (RF) impairments, which intimately exist in wireless communication systems, can 
severely limit the performance of multiple-input multiple-output (MIMO) systems. Although we can 
resort to compensation schemes to mitigate part of these impairments, a certain amount of residual 
impairments always persists. In this paper, we consider a training-based point-to-point MIMO system 
with residual transmit RF impairments (RTRI) using spatial multiplexing transmission. Specifically, we 
derive a new linear channel estimator for the proposed model, and show that RTRI create an estimation 
error floor in the high signal-to-noise ratio (SNR) regime. Moreover, we derive closed-form expressions 
for the signal-to-noise-and-interference ratio (SINR) distributions, along with analytical expressions for 
the ergodic achievable rates of zero-forcing, maximum ratio combining, and minimum mean-squared 
error receivers, respectively. In addition, we optimize the ergodic achievable rates with respect to the 
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training sequence length, and demonstrate that finite dimensional systems with RTRl generally require 
more training at high SNRs than those with ideal hardware. At last, we extend our analysis to large-scale 
MIMO configurations, and derive deterministic equivalents of the ergodic achievable rates. It is shown 
that, by deploying large receive antenna arrays, the extra training requirements due to RTRI can be 
eliminated. In fact, with sufficiently large number of receive antennas, systems with RTRI may even 
need less training than systems with ideal hardware. 

Index Terms 


Hardware impairments, large-scale MIMO, pilot optimization, random matrix theory, training-based 
channel estimation 

1. Introduction 

Point-to-point multiple-input multiple-output (MIMO) systems offer wireless eommunieation 
with high data rates, without requiring additional bandwidth or transmit power. The pioneering 
works of Q, [[^ demonstrated a linear growth in capaeity in rich scattering environments by 
deploying multiple antennas at both the transmitter and receiver sides. However, to fully reap 
the advantages that MIMO systems can offer, perfect instantaneous channel state information 
(CSI) is essential, especially at the receiver. 

In many practical systems, a training-based (or pilot-based) transmission scheme is usually 
adopted to acquire CSI. In such systems, the transmitter sends a block of symbols which contain 
both pilot and data information. The receiver estimates the instantaneous channel realization 
and uses the acquired CSI to retrieve the intended data. Under ideal radio-frequency (RF) 
hardware assumptions, channel estimation is a well investigated topic (e.g., 0-0); however, 
these assumptions are quite unrealistic in practice. RF impairments, such as in-phase/quadrature- 
phase (FQ) imbalance, high power amplifier non-linearities, and oscillator phase noise, are known 
to have a detrimental impact on practical MIMO systems (D -p3||. The influence of such 
individual hardware impairments is usually mitigated by using analog and digital signal pro¬ 


cessing algorithms [10|. However, these techniques are not able to completely remove hardware 


impairments, such that a certain amount of residual distortions always remains. These residual 
impairments stem from, for example, time-varying hardware characteristics which cannot be 
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accurately parameterized and estimated, as well as, the randomness induced by different types 
of noise and imperfect compensation schemes. 


The statistical properties of residual impairments have been investigated in a number of existing 
works. The authors in (D, o characterized and verified experimentally that the residual 
distortion caused by the impairments behaves as additive and independent Gaussian noise. This 
Gaussian behavior can be understood by the central limit theorem, when the distortions from 
many independent and different sources add up together. In particular, for Gaussian input, one can 


interpret the above observation using the Bussgang Theorem [14|, which states that the output 
process of a Gaussian input process through a non-ideal hardware is the sum of a scaled input 
process and a distortion process which is uncorrected with the input process p3| , p^ . The very 
recent paper 03 has compared the above additive Gaussian model with a sophisticated hardware 
impairment model in massive MIMO systems, which comprised several main sources of hardware 
impairments. Their results verified that the above Gaussian approximation is sufficiently accurate 
despite of its simplicity. 


The impact of residual hardware impairments has been only scarcely investigated in few works 


recently. In [ 181-[22|, the impact of residual RF impairments was studied under different system 
scenarios, with a few important conclusions being drawn. For example, [18|, p9| showed that 
impairments fundamentally limit the channel capacity in the high signal-to-noise (SNR) regime. 


while for low transmit power, such impact is negligible. Moreover, [19| has shown that by 
deploying large antenna arrays at both the transmitter and receiver, the influence of impairments 
can be alleviated. Nonetheless, these results hold only for optimal receivers which induce high 
computational complexity, and are not of particular interest for larger-scale MIMO topologies. In 
[ |22| , the authors claimed that the channel capacity is mainly limited by the residual impairments 
at the user terminals, and also proved that large-scale MIMO systems were able to tolerate 
larger hardware impairments than conventional small-scale MIMO systems. The authors therein 
also showed a few limiting effects induced by impairments in the large-antenna regime, such as 
estimation accuracy and capacity ceiling; however, they did not pursue any resource allocation 


March 31, 2015 


DRAFT 







4 


analysis, which is of pivotal importance for the performanee optimization of training-based 
MIMO systems. 

Motivated by the above mentioned limitations, in this paper, we eonsider a training-based 
MIMO system affeeted by residual transmit RF impairments (RTRI). Our main goal is to 
investigate the impaet of RTRI on the rates that are aehievable with different linear reeeivers, as 
well as to find the optimal training length. More speeifieally, our main eontributions are: 


• We derive a linear minimum mean-squared error (LMMSE) ehannel estimator of the ehannel 
with optimized training matrix strueture. We analytieally show that RTRI introduee an 
irredueible estimation error floor in the high SNR regime. 


We derive elosed-form expressions whieh tightly approximate the SINR distributions (outage 
probabilities) and ergodie aehievable rates for zero-foreing (ZF), maximum ratio eombining 


(MRC), and minimum mean-squared error (MMSE) reeeivers. In eontrast to [22|, these 
results hold for any finite number of antennas. We observe aehievable rate eeilings in the 
high SNR regime, whieh are analytieally quantified, while in the low SNR regime, we show 
that the impaet of RTRI vanishes. Moreover, we demonstrate that MRC reeeivers are mueh 
more resilient to hardware impairments than ZF and MMSE reeeivers. 

We find the optimal training length of MIMO systems with RTRI. We show that, if ZE 
and MMSE reeeivers are adopted, the optimal training length ean be mueh larger than that 
of systems with ideal hardware. In general, higher levels of RTRI impose more training 
requirements. 

Einally, we extend our analysis to a general large-seale MIMO seenario. With the help 
of random matrix theory, we derive deterministic equivalents of the ergodie aehievable 
rates. The optimal training length is thereafter found by solving an asymptotieally optimal 
eonvex optimization problem. An interesting observation is that with large number of reeeive 
antennas, systems with RTRI may need even less training than ideal hardware systems. 


The paper is struetured as follows: The system model, ineluding ehannel training and data 
transmission, is deseribed in Seetion |I^ The estimation phase is analyzed in Seetion III In 
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Section we derive the achievable rates of different linear receivers, along with the optimal 
training length. We extend our analysis to large-scale MIMO systems in Section |Vj Finally, 
conclusions are drawn in Section 

Notation: Upper and lower case boldface letters denote matrices and vectors, respectively. 
The trace of a matrix is expressed by tr(-). The n x n identity matrix is represented by I„. 
The expectation operation is E[-], while the matrix determinant is denoted by det(-). The super¬ 
scripts (■)'^, (■)“^ and (■)'^ stand for Hermitian transposition, matrix inverse and pseudo-inverse, 
respectively. The Frobenius norm and spectral norm are denoted by H-H^ and 11-112, respectively. 
The symbol CAf (m, S) denotes a multi-variate circularly-symmetric complex Gaussian (CSCG) 
distribution with mean m and covariance S. For any matrix H G hj is the Uth column 

of H. 


II. Signal and System Models 

We consider a block fading channel with coherence length T. During each block, the channel 
is constant, and is a realization of the uncorrelated Rayleigh fading model. Channel realizations 
between different blocks are assumed to be independent. 

A. System Model With RTRI 

RF impairments exist inherently in practical wireless communication systems. Due to these 
impairments, the transmitted signal will be distorted during the transmission processing, thereby 
causing a mismatch between the intended signal and what is actually transmitted. Even though 
compensation schemes are usually adopted to mitigate the effects of these impairments, there 
is always some amount of residual impairments. In p0| , pT| , the authors have shown that 
these residual impairments on the transmit side act as additive noise. Furthermore, experimental 
results in pT] | revealed that such RTRI behave like zero-mean complex Gaussian noise, but with 
the important property that their average power is proportional to the average signal power. For 
sufficient decoupling between different RF chains, such impairments are statistically independent 
across the antennas. Moreover, impairments during different channel uses are also assumed to 
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be independent!^ We now denote the RTRI noise as A. Then, the input-output relationship of a 
training-based MIMO system with Nt transmit antennas and Nr receive antennas, within a block 
of T symbols, can be expressed as 

Y=./^H(S + A)+V, (1) 

V 

where S G is the transmitted signal containing both pilot symbols and data symbols]^ 

and H G is the channel matrix. The receiver noise and the received signal are denoted 

as V G and Y G respectively. Each element of H and V follows a C7V(0,1) 

distribution. The average SNR at each receive antenna is denoted by p. At last, according to the 
above discussion, we can characterize the RTRI noise A according to 

6i ~ CA/" (O, ,E [Sidf] = 0 i,j = 1,2,... ,T,i j, 


where <5* denotes the ith column of A, and the proportionality parameter S characterizes the 
level of residual impairments in the transmitter. Note that S appears in practical applications 
as the error vector magnitude (EVM) p4|, which is commonly used to measure the quality of 


RE transceivers. Eor instance, 3GPP ETE has EVM requirements in the range [0.08, 0.175] [24|, 
where smaller EVMs are required to support high rates. The relationship between 6 and EVM 
is defined as 

. Ik. niAiill 

= ( 5 . 


EVM = 


( 2 ) 


Es[iisiiq 

Evidently, when 5 = 0, the system model simplifies to the ideal hardware case. The EVM can be 


measured in practice using well-established experiment setups, e.g. p5| . We can now decompose 
the system model in ([^ into the training and data transmission phases as follows: 


'Residual impairments refer to the aggregate noise that is left after properly compensating the hardware-impaired transmit 
signals in each transmission step. Since it is conventionally assumed that the transmitted symbols across different channel uses 
are independent, it is also reasonable to assume that the residual impairments are also independent over each symbol. The 
paper [11] provides experimental justification of this assumption. More generally speaking (not restricted to RTRI), the effects 
of transceiver distortions, such as clipping in power amplifier nonlinearities, quantization errors, can be well-approximated as 
memoryless functions [16], thus they are also independent over symbols/time. 

^Note that we choose S to be full rank, i.e., all Nt antennas are used to transmit signals. Although this is inefficient for the 
low SNR regime, it is a meaningful scheme when no CSI at the transmitter (CSIT) is available |23|. 
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1) Training Phase: 


Yp= ^/|.H(S, + A,)+Vp, tr (SpSf) = iV*T„ 


(3) 


where Sp e is the deterministie matrix of training sequenees sent over Tp ehannel uses, 

that is known by the receiver, Yp is the x Tp received matrix. The distortion noise caused 
by the RTRI is characterized as 


(5p^~CAr(0,52i^,),E 




= 0, i,j = 1,2,..., Tp, 7 ^ j. 


(4) 


where 5p. denotes the iih. column of Ap. 


2) Data Transmission Phase: 


Y, = J|.H(Sd + A,)+V,, E[tr(SfS,) 


= N,T, 


tJ-d, 


(5) 


where G is the m3.trix of ci3.t3- symbols sent over chnnnel uses, the x Td 

received signal matrix. The distortion noise caused by RTRI is characterized as 


6di ~ CV (O, ,E [SdiSdf] =0, i, j = 1,2,... ,Td,i h 


( 6 ) 


where 6di denotes the zth column of A^. 

Note that S = [Sp S^] and A = [Ap A^], and the conservation of coherence block length 
yields T = Tp + T^. 

Remark 1: We only consider the RTRI at the transmitter side since it is more dominant than 
its counterpart at the receiver. In practical communication systems, the received signal will firstly 
pass through a low noise amplifier (LNA), such that the post-processed power for the rest of the 
receiver circuit (e.g. demodulator, mixer) is always within a certain range, no matter how much 
power is actually captured by the antenna. Moreover, even if we model the receiver residual 
impairments in the same way as the RTRI, i.e., its variance is proportional to the received power 
that is received by the antenna, it is still less dominant in some practically interesting cases; for 


example, low SNR applications and large-scale MIMO with many receive antennas [19|. 
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III. Channel Estimation 

In this section, we analyze the impact of RTRI on the training phase. Our objective is to 
characterize how estimation accuracy is affected by RTRI. Conditioned on the availability of 
RTRI distortion distribution at the receiver, the current channel realization within each block 
can be estimated through the received signal Yp. The received signal contains both Gaussian 
distributed terms HSp and Vp and double Gaussian distributed term HAp; therefore, closed-form 
MMSE estimator is difficult to determine. Thus, we resort to EMMSE estimator, whose closed- 
form expression can be derived through tractable manipulations, and which performs better than 
other non-Bayesian estimators, e.g., least square (ES) estimator. 

Given the signal model in Q, we can derive the EMMSE estimator by directly extending the 
classical estimation results 0 . which is given by 


H f Sf S, + (iV + 1) 


N, 


P cH 




-1 


on 

Sp . 


(7) 


Eor any given training symbol matrix Sp, Q minimizes the mean-squared error (MSE) from 


the true channel, which is defined as MSE = tr (C ), where C = E 


H^H 


represents the 


estimation error covariance matrix, and H = H — H is the estimation error. Eurthermore, using 
a similar approach as in Q, we find that the optimal training sequence which minimizes the 


MSE should satisfy SpS" = TplAr^, and the corresponding MSE reduces to 


MSE = ^ 


1 + e 


, with e = 


pTp 


Nt{p6^ + 1)- 


( 8 ) 


The resulting estimation error covariance matrix therefore becomes C = It is straightfor¬ 

ward to show that H has i.i.d. entries with zero mean. The variance of each element in H, which is 
also defined here as the normalized MSE (NMSE), can be expressed as = i^rWtr f C) = 


NrNt 


1+e' 


By the orthogonality principle of EMMSE estimates [26|, each element in H has a variance of 

2 


4 = 1 


(Tg = 4' If we examine at the expression of the NMSE, it is trivial to find that the 


estimator reaches an error floor at high SNRs, which is quantified by taking p —)■ oo as 


NMSEn^it = 


1 


1 + 


Tp • 
NtS^ 


(9) 
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Obviously, the value of this floor depends on the level of RTRI; in general, large RTRI will 
eause severe degradation of the ehannel estimates. We ean also see from Q that, for a fixed 
level of RTRI, an inerease in the training sequenee length Tp deereases the MSB monotonieally. 

Figure shows the NMSE, cr^, of a MIMO system with = 4 for different levels of 

impairments. In this ease, we use Tp = 4 ehannel uses to transmit pilot symbols, whieh is the 
minimum length required to estimate all ehannel dimensions. Without the existenee of RTRI, 
inereasing the transmit power deereases the NMSE monotonieally towards zero. However, in 
the presenee of RTRI, we observe a fundamentally different behavior. Speoifieally, when the 
transmit power beeomes high, impairments will generate an irredueible error floor, whieh was 
explieitly quantified in Q. 



Fig. 1. Normalized mean-squared error (MSE) for different levels of impairments {Nt = Nr = 4, T = 100, Tp = 4). 

IV. Data Transmission 

In this seetion, we investigate the data transmission of a training-based MIMO system with 
spatial multiplexing seheme. We seek to analyze the impaet of RTRI on the ergodie aehievable 
rates of three most eommon linear reeeivers, namely ZF, MRC and MMSE reeeivers, and to 
find the optimal training length that maximizes the eorresponding ergodie aehievable rates. 
Speeifieally, for a given eoherenee time T, we ean formulate the following optimization problem: 

maximize R 

subjeet to Nt < Tp < T, (10) 
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where R is the ergodie aehievable rate parameterized by Tp. We derive SINK distributions for 
eaeh type of reeeiver to evaluate R. The optimal training length T* is found through a line 


seareh over all possible Tp values that satisfy the eonditions in (10). In the following, we pursue 
our analysis for exaet SNRs, as well as, high and low SNRs. 

A. Exact SNR analysis 

We start by writing the x 1 reeeived veetor through Q 


= W ( Hs, + H6d)+ 


= , —hkSdk + -i — 

y Nt ^ y Nt 


Nt 


hiSdi + Usd + H<5d 1 + Vd, k = 1,2,...,Nt 


( 11 ) 

( 12 ) 


— "^dk 


where Sdk is the transmitted data symbol on the fc-th transmit antenna, and Zdk is the total 
effeetive noise plus interferenee on the fc-th spatial stream. To reeover the signal is 

weighted by a 1 x veetor a|^, whieh for ZF, MRC and MMSE linear reeeivers is respeetively 
given by 





for ZF 



for MRC 

(13) 


for MMSE 



where gfc is the fc-th eolumn of G, with G = Hi = H ^H^Hj , and = E z^^z^lH 
is the eovarianee matrix of z^^, whieh is explieitly given by 


-1 


P 


Nt 


= h.hf + ^HH" + 


p + + 1 + e 

1 + 6 


Nr 


(14) 


Multiplying the reeeived signal with the weighting veetor, and eonditioned on that is known 
at the reeeiver, the SINR of ZF, MRC and MMSE reeeivers on the fc-th spatial stream are 
respeetively given as follows 

1 


7fc,ZF = 


5^ + co (H^H) 


-1 


(15) 


J 
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7A:,MRC — ;=^ 


Ihfhfcp 


h,|2 + 52 h,|2 + colhf h 


^ Aft ^ \ “^ 

7fc,MMSE = I E hthf + 52^ hjh^ + CqItv,. I hfc, 

\i=l,i^k i=l / 


(16) 


(17) 


where we have defined cq = + 1 +^) notational convenience, and hj denotes the zth 

column of H. We also define H = -^H, which has independent and approximately C7\/^(0,1) 

H 

entires]^ Note that for ZF receivers, we require iV^ > iV^, whilst we consider arbitrary antenna 
configurations for MRC and MMSE receivers. We now give the SINR cumulative distribution 
functions (CDFs) for these receivers in the following proposition, which, to best of our knowl¬ 
edge, are new. 

Proposition 1: For ZF, MRC and MMSE linear receivers, the CDE of the output SINR of an 
arbitrary spatial stream can be tightly approximated a^ 


^rzp(7) = 


TivIRC 


( 7 ) = 


■^EmMSB ( 7 ) 


C07 Nr-Nt ( <^01 

l_e 1=^ E 0<7<p, 


fc =0 


7 > 4, 


- ^ 0 ^ Nr-l k 


-1 ( 1 + 7 V ’ u ^ 7 ^ ^ 2 , 

, 1 — 5^'' 


fc=0 p=0 vi -52 


7 > 


-T^ Af.-l 


fc=o 


JVr —i / \ ft 

- E (i^) ’ 0 < 7 < ^, 

h—n \ ' / 


7 > 


(18) 


(19) 


( 20 ) 


2 \ 7 

A V p / V gp 
(fc-p)! 


where “ E 


V k-p ) V 1+^2 y 


p=max(0,A:—Vt + 1) 


^Due to the multiplicative term HAp, which appears in the estimated channel H, H is not exactly Gaussian distributed. 
However, we can verify easily through simulations that this it can be tightly approximated as Gaussian distributed even for high 
levels of RTRI. 

''Although our analytical results are based on the approximation mentioned in Footnote 3, we henceforth use equality signs 
without significant notational abuse. Note that our simulation results indicate that the rate approximations remain remarkably 
tight for a plethora of system parameters. 
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Proof: See Appendix ■ 

The above elosed-form expressions for the SINK CDF distributions ineorporate the statistieal 
impaet of RTRI, and most importantly, are valid for any finite number of transmit and reeeive 


antennas. Note that for <5 = 0, the above SINR expressions reduee to the ones in [271 with 


ideal hardware assumptions. Mathematieally speaking, (18)-(20) are equivalent to the outage 
probabilities]^ sinee 

^out(a:) = Pr{7 < x} = Ft{x). (21) 

Note that Proposition showeases the important faet that the outage probability is always 1 for 
7 > This implies that an SINR wall exists, whieh eannot be erossed by simply inereasing 
the transmit power. The value of this wall is inversely proportional to the square of the level of 
impairments. Note that similar eonelusions were also drawn in [ [20| in the eontext of dual-hop 
relaying. 

We now turn our attention to the ergodie aehievable rates, and invoke the following expression 

of the ergodie aehievable rate [j^ Lemma 1]: 

T, 


R=^E[Nt\og,{l + j)] 


TdNt 


1 — Ft ( 7 ) 


d7, 


( 22 ) 


ln(2)T Jq 1 + 7 

where 7 is found in ( [T5] ), ( [T^ and ( [TT] ) for ZF, MRC and MMSE reeeivers, respeetively, and 
the funetion Fr( 7 ) represents the CDF of 7 . Speeifieally, the eorresponding ergodie aehievable 
rates are given in the following theorem. 

Theorem 1: For training-based MIMO systems with RTRI, the ergodie aehievable rates of ZF, 
MRC and MMSE reeeivers ean be tightly approximated as 




Nr-Nt 




fe =0 


^k+1 


Co \ ^ f Co 


(23) 


^The outage probability per stream is always the lowest when transmitting only one stream, however, it renders a lower 
average throughput. Thus, in the following numerical analysis, we use all Nt antennas for capacity-approaching transmission. 
Note that, for the case which only < Nt antennas are used for transmission, we can evaluate the outage probability by 
substituting Nt = Nt into Proposition]^ 
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-Rmrc — 


TdNt 


Nr-l k 


ln(2)T 




fc =0 p =0 


X 


^ P+Nt / X p+jVt—i / nA 


(24) 


R 


TM 


Nr-l 


MMSE — ^ ^^T{k + 1) 


X 




Nt 


Co 


S2J 6^ (6^+1) 


Nt-i 


^ Nt-i + l,-k + Nt-iRl] 


Co 


<52 + 1 // ’ 

(25) 


where En{z) and r(n) are the exponential integral funetion [28, Eq. (8.211.1)] and the Gamma 


funetion [ |28| Eq. (8.310.1)], respeetively, while ^/(a, 6;x) is the regularized hypergeometrie 
funetion 


Eq. (9.210.2)]. 
Proof: See Appendix 


These ergodie aehievable rates are valid for arbitrary antenna eonfigurations, and ean be effi- 
eiently evaluated and easily implemented. Unfortunately, they offer limited physieal insights, 
and we, therefore, now elaborate on the low and high SNR regimes. 


B. Low SNR analysis, p —?• 0 


Eor some praetieal applieations (e.g. battery-limited users or sensors), low SNR is of partieular 
interest. Speeifieally, for ZE, MRC, and MMSE reeeivers, we have the following results. 


1) ZF receivers: As p —)■ 0, the ZE SINR beeomes 

Ty 


, p->0 _ 

7zf ~ 


N} 




- 


Consequently, we ean approximate the ergodie aehievable rate as below 


(26) 
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° = E 


TM 


( 


log 2 


1 + 


Tvp' 


\ 






-1 


k,k 


/J 


(a) Tp(T - Tp)p 


\n{2)TNi 


E 




. k,k_ 


W ~ Tp){Nr — Nt + 1) 2 

p •> 


(27) 


H2)TNt 

where (a) is obtained by using truneated Taylor’s expansion, and (6) is obtained by notieing that 


1/ (H^H) ^ follows a ehi-squared distribution with 2{Nr — Nt + 1) degrees of freedom 


k^k 


and is sealed by 1/2 p^ . 

2) MRC receivers: For very low transmit power, we ean remove negligible terms from ( [T^ , 
and arrive at the following MRC SINK expression 

I 

(28) 


p^o _ Tpp^\^^^k\ 
Tmrc ~ 


Nf ■ 

Through similar mathematieal manipulations, we get the aehievable rate of MRC reeeivers at 
low SNRs as 

0 ^ Tp{T - Tp)p^^ (29) 


ln( 2 )TAr, 


ln(2)TiVi 

where (a) is aehieved by using the faet that |h|^hfc| is a eomplex ehi-square distributed random 
variable with 2Nr degrees of freedom. 

3) MMSE receivers: For MMSE reeeivers, we first rewrite ( [T7] ) as 

-1 


(a) 

7mmse — 




-1 


(30) 


1 + h,hf + ] h, 

1 - 


( 6 ) 




CO 


-1 


k,k 


52 + 




Co 


-1 


(31) 


k,k 


where (a) follows by applying Lemma in Appendix A , while (b) is obtained by notieing 


that the quadratie form on the numerator of (30) is a elassie MMSE SINR expression with a 
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modified noise power, hence it equals 


1 


-Nt- 


1+^2 


H»H 


1 [30|. 


As p —0, we can easily find that the SINK and the ergodic 
following limits, respectively 


p^o _ Tpp'^l^k^kl 
7mmse ~ 


N? ’ 

dP-s-O _ Tp{T — Tp)Nr ^2 


ln(2)TiVt 


achievable rate approach the 

(32) 

(33) 


Comparing the ergodic achievable rates of different linear receivers in (21), (19) and (33), we find 
that the achievable rates have no dependence on <5^, i.e., the impact of RTRI has vanished in the 
low transmit power regime. Moreover, it can also be easily conjectured that the optimal training 
length always equals half of the coherence length, i.e., T* = T/2, such that as much as half of 
the coherence block should be devoted to channel training, which is in line with the result in 
0. More importantly, to maintain a certain level of performance, we can increase the number of 
receive antennas, while reducing the transmit power as fast as \/Nr and y/Nr — Nt + 1, for MRC 
(or MMSE) and ZF receivers, respectively. We can also achieve this power scaling by using less 
transmit antennas. Finally, as one would expect, MMSF and MRC receivers are equally good in 
the low SNR regime, while ZF receiver performs badly. 


C. High SNR analysis, p —)• oo 

From the expressions in ([2^-([25]), we can see that for a certain level of RTRI and system 
setup, the ergodic achievable rates only depend on the transmit power via the term cq. As p —)■ oo, 
Co —Co = <? (1+^ )Nt _ Therefore, the ergodic achievable rates of these linear receivers saturate 

Ip 

in the high SNR regime. We can easily quantify these rate limits by substituting cq = Cq back 


into the corresponding achievable rate expressions in (23)-(25). 


D. Numerical Illustrations 

In Fig. 1^ we plot the outage probabilities of linear receivers for different values of the SINR 
threshold. We consider two antenna configurations, i.e., Nt = Nr = 5 and Nt = 5, Nr = 30, at 
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p = 30dB. We observe that for a eertain SINK threshold, the outage probability of a hardware- 
impaired system is systematieally higher than that of an ideal-hardware system. More importantly, 
RTRI ereate a SINR wall, whieh eannot be erossed by inereasing the transmit power. On the 
other hand, for the ideal hardware ease, the outage probability eonverges smoothly to 1. However, 
inereasing the number of reeeive antennas ean make the outage probability approaeh the limit. 
When eomparing these reeeivers, we also find that MRC is mueh more resilient to hardware 
impairments than ZF and MMSE reeeivers, while ZF reeeivers are the most sensitive to hardware 
impairments. For instanee, for Nr = Nt = 5, the outage probability 10“^ of ZF reeeivers exhibits 
a power penalty of lldB, whilst for MMSE and MRC reeeivers this value is 4dB and OdB, 
respeetively. 



Fig. 2. Outage probability of linear receivers for different SINR threshold x. 


Figure [^illustrates the ergodie aehievable rates with ZF, MRC and MMSE linear reeeivers and 
optimized T*. The information-theoretie eapaeity, whieh ean be approaehed by optimal reeeivers 
(OR), are generated aeeording to our previous work 0 - These eurves provide upper bounds on 
the aehievable rates obtained by linear reeeivers. The OR strueture ean be a maximum-likelihood 
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receiver with joint decoding, or a MMSE receiver with successive interference cancellation 
(SIC). These curves are used as performance benchmarks. In particular, we consider a MIMO 
system with Nt = = A and coherence block T = 200. The theoretical curves are plotted 

according to the ergodic achievable rate expressions in (|2^-([^, while the numerical curves 


are plotted based on Monte-Carlo simulations. Clearly, there is an exact agreement between 
analytical and numerical results, thereby validating the correctness of the analytical expressions. 
At low SNRs, RTRI have negligible impact on the achievable rates for all receivers considered 
herein, while at high SNRs, RTRI decrease the achievable rates of ZF and MMSE receivers and 
generate finite achievable rate ceilings; however, RTRI do not have substantial impact on the 
performance of MRC receivers in this regime, due to the dominating inter-stream interference. 
As anticipated, MMSE receivers perform the best across the whole SNR regime. In contrast 
to ideal hardware systems, we can see that the performance gap between optimal receivers and 
MMSE/ZF receivers in hardware-impaired systems diminishes as SNR increases. In other words, 
for hardware-impaired systems, MMSE and ZF receivers become near optimal in the high SNR 
regime. 


With the same system setup, we demonstrate the optimal training length T* for systems with 
different linear receivers in Fig. T* values for optimal receivers are drawn according to 0 
Section FV-C]. For low SNR values, considerable amount of training is needed. As we have 
proved in Section 


IV-B 


T* converges to as p ^ 0. As the SNR increases, ideal hardware 


systems require less channel training, until the pilot length reaches the minimum possible value 
T* = Nt. However, for systems with RTRI, the conclusions are different. If ZF or MMSE 
receivers are considered, it is noteworthy that more pilot symbols are needed in the high SNR 
regime; this phenomenon indicates that, although the effective SINR saturates due to RTRI, 
increasing the training length can still provide performance gains by improving the estimation 
accuracy. If MRC receivers are considered, there is no such extra training benefit. This is due 
to the fact that MRC receivers are dominated by inter-stream interference for both ideal and 
hardware-impaired systems, such that, the RTRI distortions have only insignificant impact on 
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the system performanee. As a result, the optimal training length for hardware-impaired systems 
does not deviate mueh from that of ideal hardware systems. Moreover, MMSE and ZF reeeivers 
generally require more training, while MRC reeeivers require less training, in eomparison to 
optimal reeeivers. 



Fig. 3. Ergodic achievable rates with different linear receivers and optimized Tp {Nt = Nr = 4, T = 200). 


V. Extension to Earge-Scale MIMO Systems 


Earge-seale MIMO systems are eonsidered as a key enabler for next generation wireless 
systems, sinee it ean offer huge improvements in throughput, as well as, energy effieieney. 


Moreover, they are very robust against small-seale fading and intentional jamming [311. For 
large-seale MIMO systems with ideal hardware, the training overhead has been analyzed in, for 
example, 0, 0, where it has been shown that the training requirement does not neeessarily 
to be inereased linearly with the number of antennas. However, large-seale MIMO systems 
are expeeted to deploy lower-eost radio frequeney (RE) eomponents, whieh are more prone 
to hardware impairments. Motivated by this important observation, in this seetion, we will 
investigate the impaet of RTRI on large-seale systems. We derive deterministie equivalents of 
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Fig. 4. Optimal training sequence length for linear receivers with various levels of RTRI (Nt = Nr = 4, T = 200). 


the ergodic achievable rates, and thereafter find the optimal training length. We assume that the 
coherence time T is big enough to support our large dimensional analysis. As we will show 
later, our deterministic equivalents are accurate even for small number of antennas; therefore the 


dependence on large T is not essential [ 321. 


A. Deterministic Equivalents 

For each type of linear receiver, we now present the deterministic equivalent approximations 
of the SINK and achievable rates in the large system limit, i.e., for Nt ^ oo with a finite 
ratio ^ e [l,oo)|] 

Theorem 2: As Nt —)■ oo, with a finite ratio /3 = ^ G [1, oo)|^the SINRs of ZF, MRC, and 


®Since the SINRs of each stream are statistically independent, we drop the dependence on k for the sake of clarity. 

^For ZF receivers, we restrict j3 to be in the range (1, oo). For j3 = 1, one can show that H^H has at least one eigenvalue 
converging to zero |33| , which makes the ZF analysis intractable. Due to page limit, we ignore the case of /) = 1, one can find 
a detailed study of this case, for example, in |34|. 


March 31, 2015 


DRAFT 






























20 


MMSE receivers, almost surely converge to the following deterministic equivalents, respectively 


Nr,Nt^OO 

7zf 


Nr,Nt^OO 

/MRC 


A 

7zf — 

7mrc 


/g-1 

6‘^{/3 - 1) + Cl’ 

^^_ 

1 + (52 + 01 +^ 2 / 3 ’ 


Nr,Nt^oo a.s. _ A 

7mmse ^ 7mmse — 


^ + \J(P + 

1 + TT^) 


(34) 

(35) 

(36) 


where + ^ e ^ and d ^ ^ + 1 - /5. 

Proof: See Appendix ■ 

Theorem 1^ provides us with deterministic approximations of the SINRs in the large-antenna 
regime, which do not rely on random channel realizations, as long as the ratio /3 is fixed. The 
above results are quite general and tractable, and as we will show later, are very accurate even 
for small number of antennas. Note that some practical interesting system setups, for instance, 
Nt -C Nr (usually appearing in the multi-user MIMO uplink), can be treated as a special case 
of Theorem In particular, we will give this result in the following corollary. 


Corollary 1: For the case Nt <C Nr, such that /3 —>■ cx), the SINRs of ZF, MRC, and MMSF 
receivers, converge almost surely to the same deterministic equivalent 





(37) 


This corollary is consistent with the asymptotic results in [18|. From the above equation, we 
can see that the SINR reaches a limit when A) <C Nr. This limit, which is only quantified by 


the level of RTRI, is consistent with the outage probability analysis in (18)-(20). In this case, 
increasing the transmit power does not provide any SINR gain. 

After establishing the asymptotic SINRs, we proceed to derive the achievable rates. Using 


the continuous mapping theorem [35 Theorem 2.3] and dominated convergence theorem 
Theorem 16.4], we can show that the achievable rates by different receivers converge to the 
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following deterministic equivalent 


j:^Nr,Nt—^00 


R 



iViloga (1 + 7 ) , 


(38) 


where 7 is found in (34), (35) and (36) for ZF, MRC and MMSE, respectively. Note that one can 


easily prove that i? is a strictly concave function in Tp [37|. Then, as proved in [38, Theorem 
4], optimizing asymptotically identical to optimizing R. Therefore, solving the 

following optimization problem will return the desired T*: 


maximize R^r,Nt^oo 

subject to Nt <Tp < T. (39) 


With the results in Theorem and (38), we can find T*, for example, via a line search over 
different Tp values (e.g., using the bisection method). As for the special case in Corollary it 
follows readily that the achievable rates of different receivers all converge to 


Rf^- 


R = 


l-^]N,\og, 


1 

1 + ^ 


(40) 


Obviously, in this scenario, T* = Nt channel uses should be devoted for training. Furthermore, 
the number of transmit antennas should not be larger than half of the coherence time, i.e., 
Nt < otherwise, the channel training overhead will cause a decrease in the achievable rate. 


B. Numerical Illustrations 

Figure compares the deterministic equivalents of the ergodic achievable rates with the 
ergodic achievable rates we derived in the previous section . For MMSE receivers, we see an 
excellent match in all cases, even for low dimensional systems. Eor small number of antennas, 
the achievable rate approximations of ZE receivers at low SNRs and of MRC receivers at high 
SNRs have moderate deviations from the true values, while these deviations vanish quickly as 
Nr increases. Comparing these four plots, we can conclude that, for high SNR values, RTRI 
always make the convergence to the asymptotic expressions faster, whilst for low SNRs, RTRI 
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slow down the convergence. We can also see that, increasing the ratio /? always guarantees better 
convergence. 

In Fig. we illustrate the optimal training length T* in the large-antenna regime. We note 
that in the upper subplot, where the system is “not so massive”, RTRI still create extra training 
requirements in the high SNR regime. However, when the system is equipped with large receive 
antenna arrays, the optimal training length is reduced, and the optimal T* may be even less than 
those of ideal hardware systems. The reason behind this phenomenon is that the improvement 
provided by estimation saturates very fast, and we can still have certain gain by allocating more 
time for data transmission. 


p = OdB, 13 = 2 p = 30dB, 13 = 2 




-MRC 

.MMSE 

p = 0dB,/3 = 8 ^ = 0.05 p = 30dB,/3 = 8 




Fig. 5. Deviation of the achievable rate deterministic equivalents from the ergodic achievable rates (T = 500). 
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Fig. 6. Optimal training length in the asymptotic regime (T = 500). 


VI. Conclusions 

In this paper, we analyzed the performance of training-based MIMO systems with residual 
hardware impairments. We derived a LMMSE channel estimator for the proposed model, and 
deduced the irreducible estimation error floor in the high SNR regime. We derived SINK 
distributions for ZF, MRC and MMSE linear receivers, and found the corresponding closed-form 
ergodic achievable rates. We observed that the achievable rates saturated in the high SNR regime. 
The optimal training lengths were thereafter found through optimizing the ergodic achievable 
rates. Generally, RTRI imposed more training requirement at high SNRs for MMSE and ZF 
receivers. However, for MRC receivers, there was no difference in the optimal training length as 
compared to ideal hardware systems. Moreover, for large-scale systems, we derived deterministic 
equivalents of the ergodic achievable rates, which were shown to be accurate even for small 
number of antennas. In this case, we found that by deploying large receive antenna arrays, the 
extra training requirement imposed by RTRI was alleviated; with sufficiently large number of 
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receive antennas, systems with RTRI needed even less training than ideal hardware systems. The 
reason behind this phenomenon is that, the improvement offered by channel estimation saturates 
very fast, while we can still have performance gain by allocating more channel uses to data 
transmission. Thus, it is important to find a good tradeoff between channel estimation and data 
transmission. 


Appendix A 

Useful random matrix theory results 

Lemma 1: (Matrix inversion lemma Let A G be a Hermitian invertible matrix, 

and let x G and r be arbitrary vector and scalar, respectively. Then, we have 

x^A-i 


x(A + rxx 


H\-l 


1 + rx^A~^x’ 


(41) 


Lemma 2: (Trace lemma [39 Lemma B.26]) Let A G and x, y ~ C7V(0, ^Itv)- 


Assume that A has uniformly bounded spectral norm with respect to N, and that x and y are 
mutually independent, and independent of A. Then, as —)■ cxo, we have 


x"Ax--tr(A) 


x^Ay 


0 


0 . 


(42) 

(43) 


Lemma 3: (Rank-1 perturbation lemma [40]) Let z < 0, A, B G with B Hermitian 

nonnegative definite, and v G C'^, then 

' |A| I 2 


|tr ((B - zl^)-i - (B + vv^ - zItv)"^) A| < 


(44) 


Lemma 4: Let A G denote as the eigenvalue CDF of A^A, and as 


the Stieltjes transform of ^ . Then, for z G 


n 


—mp^Hj,(z)=mpj,^H(z)+ ^ ^ 


we have [41 Lemma 3.1] 
— n 1 


(45) 


The notation 2 € C\R^ denotes all 2 € C, 2 ^ ' 
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Appendix B 

CDF OF THE SINK WITH ZF RECEIVERS 


Denoting x = 1/ (H^H) ^ 


k^k 


, we know from [29| that x follows a chi-square distribution 


with 2{Nj. — Nt + 1) degrees of freedom and is scaled by 1/2, with its CDF given by 

Nr-Nt fc 

Fx{x) = l-e-^ ?7- (46) 

Rewrite the SINK of ZF receiver as 


k\ 


k=0 


X 


7zf — 


M + Co’ 


then we finally have 


— Pr(rzF <l) — Fx 


Col 


1 — ( 5^7 


(47) 


(48) 


CDF OE THE SINK WITH MRC RECEIVERS 


First, we rewrite 7mrc as 


7mrc — 




Nt 


i=l,i^Nt ^ 


+ hfc + Co 


(49) 


{l + 5^)Y + 5^Z + co' 

where Z is chi-square distributed with 2Nr degrees of freedom, and Y is chi-square distributed 


with 2{Nt — 1) degrees of freedom with PDF [42| 


friy) = 


e ^ 
(iVt- 2 )!- 


(50) 
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We can now express the CDF of the SINK with MRC as 


-^rMRc( 7 ) - Pr(rMRC < 7) 

'(1 + 5^)7 , Co7 


(51) 


Fy 


y + 


1 — (5^7 1 — 5^7 


fY{y)dy 


= 1 - 




A^r-l I 1 k , .2\ P ^00 

\ 1 - 52 ^ j ^ fk\ f 1 + 5^^ 


(Nt - 2) 


fc =0 


E 

p=0 


k\ \p 


Co 


1+7 


y 


■Nt+p-2^ Y^^ydy (52) 


= 1- 

f AT,-9)1 


—an Af ,-1 f C 07 \ k /,x , . 2 \p 


W-2)! 7; « “VP 


E 


A:=0 


p=0 


k\ fl + 6^Y {Nt+p- 2 )\ 


Co 


1 +^ 

1 — 5^7 J 


= 1 - 


AT^-l k {Nt+p-2\ , r2 \ P r ^O'y ^ 

^ V P j +5^V ',l-<5+y 

iVt-1 ,Z^ ,Z^ f^ _ 


1+7 A 

1 — 5^7 J 


,.o 

■v* 


1+7 

1—5^2' 


P • 


— ^ p,k 


The integral in 52 can be evaluated via the identity Eq. (3.381.4)]. 


(53) 


(54) 


CDF OF THE SINR WITH MMSE RECEIVERS 


We start from the equivalent form in (30), and have 

W 

7mmse — 


where W = ]Y [Y^^=i i^k^i^f + i^Iw) is a classical MMSE SINR expression, with 


l + d^ + d^W 


-1 


(55) 


CDF given by [ [43] Eq. 11] and Eq. 8] 

Fw{w) = 1 — e 1 +^ y 

where 


Nr —1 A f \ / ^ ^ 

cnx __ Ak\w) ( C^W 


k =0 


k\ V1 + 


(56) 


Akiw) + 


1, if Nr > Nt + k, 

Nr - k -1 

E 

i=l 


1 + "e" 


otherwise. 


Using a similar approach as in the above proofs, followed by some simple manipulations, we 
obtain the final result. 
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Appendix C 
Proof of Theorem [I] 

A. Achievable rate of ZF receivers 

Firstly, we can easily prove that the SINK should be in the range (0, p), which also holds 
for MRC and MMSE receivers. Therefore, substituting ([T8]) into (22), we have 


i?ZF — 


TdNt 

ln(2)T 


1 —sn Nr-Nt ( ^ 0 '^ 

g 1 -<S 27 ^ 1-527 


1 + 7 


fc =0 


k\ 


-d'y 


(a) TdNt 


Nr-Nt 




^ ml 

i>—r\ ^ U 


e 


(1 + x){6'^ + (5^ + l)x) 


dx 


[b) TdNt 


Nr-Nt 


Co 


ln(2)r \P 


fOO ^—x^k /•oo ^—x^k 

-dx — / - -dx 


'0 


X + 


52 

52+1 


'0 


X + 1 


(57) 

(58) 

(59) 


where (a) is obtained by a change of variables —>■ x and exchanging the order of integration 


and summation, and ( 6 ) is obtained by expanding the integrand in (58). Finally, applying [28 
Eq. (3.383.10)] in (59) and the fact that En{z) = — n,z), where T{n,z) is the upper 


incomplete gamma function [ |28| Eq. (8.350.2)], concludes the proof. The achievable rate of 
MMSE receivers can be deduced in a similar way. 


B. Achievable rate of MRC receivers 


Substituting the SINR with MRC in (18) into (22), and applying the change of variables 
—>■ X, as well as changing the order of summation and integration, we have 


, TM 

' k=0 p=0 


Co 


P+M 


1 


e ^x^ 


r + ij 7+f) 7+ *)’■+"■ 

Note that for any a, 6 G C, a 7 ^ 6 , we can easily prove the following identity: 


dx. 


(60) 


{x + a)(x + by 


{b — a) " (6 — a)' 

X + a ^ ix + b) 
1=1 ^ ^ 


'\n—2+1 


(61) 


Applying this result to ( [60] ), and using the integral identity in [ |44] Eq. (2.3.6.9)] followed by 
some algebraic manipulations, we arrive at the desired result. 
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Appendix D 
Proof of Theorem [2] 

A. Deterministic equivalent of ZF SINK 

The singular value deeomposition (SVD) of H is denoted by H = USV^, where U and V 
are unitary matriees, while S is a x A* diagonal matrix with the diagonal elements eontaining 
the singular values of H. Consequently, we have 


H^H) ^ = [VAV^l, , = vf Avfc ^ —tr (A) ^ 

^ \k,k ^ ^ (n^ A ^ ' 


(a) Nf 


Nr-Nf 


(62) 


t-tX -1 


where A is a diagonal matrix with its diagonal elements being the eigenvalues of (H^H) 
Note that is known to satisfy the eonditions of x in Lemma [34|, therefore (a) follows 
readily. Substituting the above result into the ZF SINK expressions in ([T5]) completes the proof. 


B. Deterministic equivalent of MRC SINK 
Rewriting the SINK of MRC, we have 


7fc,MRC — 




((1 + + CoU) K 

h^hk '2 


(1 + h,hf + h, + <52 I 

a.s. 1 

a.s, /? 


(b) 1 + 52 +Cl+ ,52^’ 

where (a) and (6) are obtained by applying Lemma 


C. Deterministic equivalent of MMSE SINK 


First, rewrite the MMSE SINR expression in (30) as 


7A:,MMSE 

7A:,MMSE — . , „ , .O'-’ 

1 + <)2 + ()27fc MMSE 


(63) 

(64) 

(65) 

( 66 ) 


(67) 
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where 7fc,MMSE = hf ) h^.. As Nr, Nt ^ oo with /3 = ^ e [1, oo), 

we have 


-1 


Co 


7A:,MMSE -^ I §2 ^ — tr I I HH 

(a) 1 + 


H 


Co 


1 + 52 


Nr 


-V 


( 68 ) 




Co 


Nt 


- 1 ' 




1+52 


Co 

a.s. 1 + 52 


{Nr - Nt) + 


rrco(l + hf (HH^ + ^I^^)-^h, 

(1 + S^)Nt 


(69) 


w Co ' ■ ■' Co(l + mir^^_(j^))’ 

where -Fg^H is th^ eigenvalue CDF of HH^, and (t^) Stieltjes transform of 

FgHfl with argument [411, (a) is obtained by using Lemma(6 ) is obtained by using the 
matrix inversion identity ( |4T] ) and Lemma while (c) is obtained by using Lemma 
Combining ([68]) and (|69|), we ean easily obtain 




(70) 


Substituting this result into (67) and applying the eontinuous mapping theorem [35 Theorem 
2.3] eoneludes the proof. 
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